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Abstract. We analyze the drunkard's walk on the unit sphere with 
step size and show that the walk converges in order C/sin 8 steps 
in the discrepancy metric (C a constant). This is an application of 
techniques we develop for bounding the discrepancy of random walks on 
Gelfand pairs generated by bi-invariant measures. In such cases, Fourier 
analysis on the acting group admits tractable computations involving 
spherical functions. We advocate the use of discrepancy as a metric on 
probabilities for state spaces with isometric group actions. 



1. Introduction 

Fix 9 £ (0,7r). Consider the following random walk on the unit sphere 
S 2 in R 3 , whose steps are geodesic arcs of length 9. (Such arcs subtend an 
angle of 9 at the center of the sphere) . The random walk starts at the north 
pole, and at each step a uniformly random direction is chosen and the walk 
moves a geodesic distance 9 in that direction. We refer to this walk as the 
drunkard's walk on the sphere. 

The purpose of this paper is to develop techniques for bounding the dis- 
crepancy metric for random walks on Gelfand pairs, using the drunkard's 
walk as an example. Our bounds are sharp enough to give a rate of conver- 
gence. Let D(k) denote the discrepancy distance (defined later) between the 
k-th step probability distribution of the drunkard's walk and the uniform 
(rotation-invariant) measure on S 2 . We show the following: 

Theorem 1. For the drunkard's walk on the unit sphere S 2 with step size 
6, the discrepancy of the walk after k steps satisfies, for k = . ^ , 



0.4330 e~ c/2 < D(k) < 4.442 e" 



■C/8 



Thus order C / sin 2 9 steps are both necessary and sufficient to make the 
discrepancy distance of this walk from its limiting distribution uniformly 
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small. The result makes intuitive sense, since the number of steps to random 
should be large when 9 is close to or tt, and small when 9 is close to n/2. 
Moreover, if 9 ~ 1/n for large n, then this result shows that order n 2 steps 
are necessary and sufficient; this is similar to nearest-neighbor random walks 
on Z/nZ (e.g., see Q). We also note that given 9 this walk does not exhibit 
a sharp cutoff phenomenon. 

We frame our analysis in the context of a random walk on a homogeneous 
space, i.e., a space with a transitive group action. S 2 is a homogeneous space 
by the action of SO(3). Although the drunkard's walk is not generated by 
a group action, we show its equivalence with a walk that is. 

For random walks on groups, Fourier analysis is often used to obtain 
rates of convergence. On homogeneous spaces, we can lift the walk to acting 
group and do Fourier analysis there, although for non-commutative groups 
the group representations can be quite complicated. However, when the 
homogeneous space is a Gelfand pair (as in this case), Fourier transforms of 
bi-invariant measures and functions on the group simplify greatly, allowing 
for tractable computations involving the spherical functions. In our example, 
the generating measure on SO (3) can be made bi-invariant, and the spherical 
functions are Legendre polynomials. 

Much of the literature on random walks on Gelfand pairs is limited to dis- 
crete homogeneous spaces. Diaconis Q presents a survey and an annotated 
bibliography; applications include walks on subspaces of vector spaces over 
finite fields Jll| and walks on r-sets of an ra-set ||]. Rates are given in the 
total variation metric. 

Continuous examples have been addressed by Voit, who studied families 
of isotropic random walks on spheres 1 23 , ^] and other homogeneous spaces 
[ 24 1 . Central limit theorems are obtained using convergence in distribution 
or total variation as the dimension n — > oo. Such results differ from ours in 
that: (1) we work with a specific walk rather than a family, e.g., we obtain 
explicit bounds for a specific n rather than asymptotic results for large n, 
(2) we focus on rates of convergence of the walk on the homogeneous space, 
rather than convergence of a central limit theorem on the double coset space 
as m 

min, and ( 3 ) 

we use the discrepancy metric to measure convergence. 
We argue that it is a natural metric to use for walks on homogeneous spaces, 
and develop techniques to bound it. While we only illustrate our methods 
on the 2-sphere, similar methods can be used to give explicit discrepancy 
bounds for walks on high-dimensional spheres and other Gelfand pairs. 

For walks on continuous groups, we mention the work of Rosenthal IS] 
and Porod |T?], EM], who obtain total variation rates of convergence for ran- 
dom walks on SO{n) and other compact groups where the generating mea- 
sures are conjugate-invariant; this is another situation where the represen- 
tations simplify enough to get Fourier bounds. 

This paper is organized as follows. Section 2 gives background on the dis- 
crepancy metric and justifies its use over other common metrics on probabil- 
ities. Section 3 develops several equivalent formulations for the drunkard's 
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walk. Section 4 gives a formulation with a bi-invariant generating measure. 
This simplifies the Fourier analysis in Section 5, where matching upper and 
lower bounds for the convergence rate of the drunkard's walk are derived 
(Theorems ^ and ^). We summarize our methods for handling random 
walks on arbitrary Gelfand pairs in Section 6. 

For the uninitiated, Appendix A collects relevant background on Fourier 
analysis on groups, Gelfand pairs, and representations of 50(3) that are 
needed to make this paper self-contained. Appendix B contains proofs of 
technical results that are not central to the development of the ideas in this 
paper. 



2. The discrepancy metric 

Let (X, d) be a metric space with metric d. Given any two probability 
measures P,Q on X, define the discrepancy distance between P and Q by: 

D(P,Q)= sup \P{B)-Q{B)\ 

all balls B 

where a "ball" in X denotes any subset of the form {x : d(x,xo) < r} for 
some xq £ X and real number r > 0. It is easy to check that the discrepancy 
is a metric on probability measures. 

When X is the unit cube in R n and Q is Lebesgue measure on X, this 
definition reduces to the notion of discrepancy commonly used by number 
theorists to study uniform distribution of sequences in the unit cube (e.g., see 
[§, [R]]). Diaconis [f| was perhaps the first to suggest the use of discrepancy 
to measure rates of convergence of random walks. Su [20, 21, 22] explored 



properties of this metric and obtained sharp rates of convergence for certain 
random walks on the hypercube, circle, and torus. 

We shall be concerned with the case where X = S 2 and the metric on S 2 is 
inherited from its inclusion in R 3 . Thus balls may be visualized as spherical 
"caps" on the sphere. As noted later, the group of rotations 50(3) acts on 
S 2 in a natural way and the metric on S 2 is invariant under this action. 
Thus images of balls under this action are still balls, so the discrepancy 
metric on measures inherits this rotation invariance. 

Unlike the total variation metric (which is in more frequent use among 
probabilists), the discrepancy metric recognizes both the topology and the 
group action on the underlying space. For infinite compact state spaces this 
can be important. For instance, if P is a probability measure on S 2 sup- 
ported on a finite set of points, and Q is the uniform (rotation-invariant) 
probability measure, then the total variation distance between P and Q re- 
mains equal to 1 no matter how the points are arranged. On the other hand, 
the discrepancy D(P,Q) will capture how "well-distributed" the points in 
P are. Another example is a simple random walk on the circle generated 
by an irrational rotation (see which converges weak-* to Haar mea- 

sure; discrepancy captures this convergence, but total variation is blind to 
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it. Thus the discrepancy metric is well-suited to studying random walks on 
continuous state spaces generated by isometric group actions. 

We favor the use of discrepancy over other common metrics (e.g., the 
Prokhorov, Wasserstein metrics) because there are tractable bounding tech- 
niques for discrepancy involving Fourier coefficients. In fact, one of the main 
goals of this paper is to show that we can develop upper and lower bounds 
for discrepancy which give sharp rates of convergence in many cases because 
the dominant terms in each expression match. 

We remark that discrepancy bounds can be used to bound other metrics 
by exploiting known relationships between them for instance, the dis- 
crepancy is bounded above by total variation, so discrepancy lower bounds 
also offer a way to obtain total variation lower bounds. 

A property that will be needed later is that on groups, discrepancy de- 
creases with convolution: 

Theorem 2. If P,Q,i/ are arbitrary probability measures on a compact 
group G, then 

D{P*v,Q*v) < D(P,Q). 
Hence when Q = U , the uniform (Haar) measure, we have 
(1) D{P*u,U) < D(P,U). 



See [21] for a proof. 



3. The Drunkard's Walk and Equivalent Formulations 

The drunkard's walk on the sphere is not a random walk generated by a 
group action. However, we show in this section that it is equivalent to one 
that is, in the sense that the two random walks generate the same k-th step 
probability distribution even though their observed behaviors may appear 
quite different. 

Readers familiar with hypergroups may not be surprised by the equivalence 
and the ensuing analysis, since the associated double coset space of this walk 
is a commutative hypergroup, and much of our analysis can be framed in 
that language. We have avoided it; interested parties are referred to j|. 

Let N be the isotropy subgroup of SO (3) fixing n, the north pole. Let 
E C SO (3) denote the set of all rotations which fix a point on the equator 
and move the north pole by geodesic distance 8 along the surface of the 
sphere. Let Q denote the probability distribution supported on E that is 
left AMnvariant. 

Formulation 1. The Drunkard's Walk. This is the walk considered 
at the opening of this paper; a drunkard starts at the north pole and at each 
step picks a uniformly random direction and advances along the sphere in 
that direction by geodesic distance 9. 

Let Yfc for k = 0,1,2,... denote random variables which describe the 
location of the drunkard at time k. Thus Yo = n. If gi is an 50(3)-valued 
random variable with values in E and distribution Q, the position of the 
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drunkard at time k is given by = g±g2 ■ ■ ■ g k n. Thus, this walk is not 
a random walk in which the next position is generated by applying group 
actions to the current position in the walk. However, the following random 
walk is: 

Formulation 2. The Potted Plant. Consider a potted plant initially 
at the north pole. At each step, a rotation is chosen randomly from E 
according to Q and performed on the sphere. Thus the point currently over 
the north pole moves a distance 6 in any direction. This induces a motion 
of the potted plant, wherever it currently is. 

Note that with the given generating set E, the potted plant is moved a 
geodesic distance less than or equal to 6 at every step, since for each rotation 
in E, n is on the equator of the rotation axis and hence moves the farthest. 

If gi is a .SO^-valued random variable with distribution Q, the position 
of the potted plant at time k is given by = g k g k -\ • • • 9i n - Since the gi 
are independent and identically distributed, this shows that Formulations 1 
and 2 are equivalent and generate the same k-th step probability distribution 
on the sphere. This may be surprising in light of the fact that the steps of 
the random walk in Formulation 2 are smaller than in Formulation 1. 

The next random walk, while not essential in what follows, also generates 
the same k-th step probability distribution and we mention it for the sake 
of interest. 

Formulation 3. Rotate and Spin. Fix any rotation Rg which displaces 
the current north pole by geodesic distance 9. Start the random walk at the 
north pole, and at each step perform Rg followed by a uniform spin around 
the north-south axis. (The uniform spin moves the random walk to a random 
point anywhere on the same latitude.) 

Though Rg is not necessarily contained in the set E defined earlier, it 
does yield the same k-th step probability distribution as the previous for- 
mulations. This may be seen as follows. 

Consider the double coset space SO(3)/ /N. Each double coset is charac- 
terized by the latitude to which it sends the north pole. Thus Rg = n'gon" 
for some n',n" G N and a go G E. Then E = Ng$. Let n« denote an 
iV-valued random variable distributed according to Haar measure on N . 
The walk description shows that at the i-th step, riin'gon" acts on the ran- 
dom walk's current position. Therefore its position at time k is given by 
Y k = (n k n'gQn")(n k ^i n'g n") ■ ■ ■ (m n'g n")n. Since run'go and n"mn'g 
are identically distributed according to Q and n"n = n, the above random 
variable has the same k-th. step distribution as the other formulations above. 

Our original goal was to study Formulation 1, the drunkards' walk. Via 
the above equivalence we choose instead to study Formulation 2, because it 
is a random walk generated by a group action. However, the generating mea- 



sure Q, while left TV-invariant , is not bi-invariant. In light of Theorem 12, a 
bi-invariant generating measure would greatly simplify the ensuing Fourier 
analysis. (See Appendix A for background material on Fourier analysis on 
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compact groups and bi- invariant measures). In the next section, we remedy 
this problem by introducing a fourth random walk (Formulation 4) which is 
equivalent to Formulation 2 and whose generating measure is bi-invariant. 

4. A Bi-invariant Formulation 

We are interested in the discrepancy distance between the fc-th step dis- 
tribution of the drunkard's walk and Ug2, the uniform (rotation-invariant) 
distribution on S 2 . To simplify notation, we write 

(2) D(k) = D(C(Y k ),U s2 ) 

where £(Yfc) denotes the distribution of the random variable Y/% in Formu- 
lation 2. We investigate the behavior of D(k) as a function of the number 
of steps k. 

Recall that the homogeneous space 5 2 can be regarded as the left cosets 
of N in 50(3), so that the quotient map 50(3) — ► S 2 sends a rotation g 
to the point gn E S 2 . A random walk on S 2 generated by an 50 (3)-action 
(such as Formulation 2) may then be regarded as a random walk "upstairs" 
on 50(3) with an initial distribution Un, Haar measure on iV (which is the 
pre- image of the starting point n). The probability distribution upstairs 
evolves as usual for a random walk on a group, so that after one step the 
distribution is given by Q*Un and after k steps by Q* k *U]y. The probability 
of finding the original walk in a ball B C S 2 is the same as finding the lifted 
walk on 50(3) in B = BN C 50(3). Hence 

(3) D(k) = sup \Q* k * Un (B) — U(B)\. 

B 

where U is Haar measure on 50(3) and the supremum is taken over all B, 
pre- images of balls under the quotient map 50(3) — > S 2 . 

At this point we would appeal to Fourier analysis to deal with the con- 
volutions above. However, Q is left iV-invariant but not bi-invariant; recall 
that we desire bi-invariance to simplify the Fourier analysis. 

The following proposition shows that for random walks on groups, aver- 
aging the generating measure Q to make it bi-invariant will affect the rate of 
convergence in discrepancy by at most one step. This result is the analogue 
of a result of Greenhalgh [pH] , who obtained a similar result for the total 
variation distance. 

Proposition 3. Let Q denote any left N-invariant probability measure on 
a group G, let U and U n denote Haar measure on G and N respectively. If 
Q = Q * Un , then Q is N -bi-invariant and 

D(Q* k , U) < D(Q* k , U) < D{Q< h - l \U). 

Proof. Left invariance for Q means Un * Q = Q. We use this to establish 
bi-invariance for Q, which means Un * Q * Un = Q- This follows from 
Un * Q *Un = Un * (Q * Un) *Un = (Un * Q) * (Un *Un) = Q * Un = Q- 
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For the second assertion, note that 

Q* k = (Q*U N )* k = Q*(U N *Q)< k ~ 1 Ku N = Q* k *U N . 
Q* k = Q*(U N * Q)< k -V = (Q * U N )< k -^ * Q = Q< k - 1 ) * Q. 

Using Theorem | we obtain D(Q* k * U N , U) < D{Q* k , U) and D(Q< k ~^ * 
Q,U) < D{Q* (yk ^ l \U), which with the above equations yield the desired 
conclusion. □ 



Thus a random walk on a group with generating measure Q differs by no 
more than one step from a random walk proceeding according to Q, which 
may be viewed as the average of the measure Q over the left cosets of N. 

However, for a random walk on a homogeneous space, even more can be 
said if the walk begins at the point fixed by the isotropy subgroup: 

Proposition 4. Suppose X be a homogeneous G-space with isotropy sub- 
group N fixing xo £ X, and Q is a left-invariant probability on G. Let D(k) 
denote the discrepancy of the random walk starting at xq and evolving via 
a group action with elements chosen according to Q. Let D(k) denote the 
discrepancy of the random walk starting at xq, but evolving according to Q. 
Then 

D(k) = D(k). 

Proof. This follows from the fact shown in the previous proof, that 

Q*k = Q*k # Un _ 

The right side, when regarded as a measure on S 2 , describes the location 
of the Q-generated walk. But by the right invariance of Q, the left side is 
equal to Q* k * Un, which when regarded as a measure on S 2 , describes the 
location of the Q-generated walk. □ 

This shows that the following is equivalent to Formulation 2. 

Formulation 4. Let E denote the set of all rotations in SO(3) which 
move the north pole n by a fixed angle 0. Let Q = Q*Un be the bi- invariant 
generating measure obtained by averaging Q from Formulation 2. Consider 
a potted plant which starts at n and is moved according to the following 
rule: at each step, a rotation is chosen randomly from E according to Q and 
performed on the sphere. This induces a motion of the potted plant to a 
new location. 

Observe that we are able to throw extra rotations in "for free" and still 
obtain the same fc-th step probability distribution. This may be surprising 
because with the extra generating elements the step size of the potted plant 
is no longer bounded by 8, as it was in Formulation 2. In fact, the potted 
plant could be moved around rather wildly at each step. 
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Exploiting this equivalence, we shall, in the sequel, work with Formulation 
4. To save notation we write Q for the bi-invariant measure Q. Right- 
invariance for Q yields Q = Q * Un, which when substituted into (|3|) gives 

(4) D(k) = sup\Q* k (B)-U(B)\. 

B 

where the supremum is taken over all ball pre-images B. Hence, the dis- 
crepancy D(k) as defined in (S) can now be analyzed using expression (Q). 

5. A Rate of Convergence 

We now proceed to derive a rate of convergence for the drunkard's walk 
on the sphere. Several calculations require the facts reviewed in Appendices 
A and B; we alert the reader with references. 

Let By tr denote a ball of geodesic radius r centered at y G S 2 . Such balls 
look like spherical "caps" on S 2 . Let B y ^ r denote its pre-image "upstairs" 
in 50(3). To reduce notation, write B r = B n ^ r for the pre-image of a ball 
centered around n. Let 5 r denote the indicator function of B r on S0(3). 

A key observation (see Q) for evaluating measures on balls is they can 
be regarded as convolutions with indicator functions on those balls, i.e., for 
any right-invariant measure v on SO (3), 

(5) v(By )r ) = v * 5 r (y) 

for any y £ yN. This follows from v * S r (y) = f ge § dv(yg~ l ) = v(y ■ B r ). 
From (El) we have 



D(k) 



(6) 



sup 

V,r 

sup 

y,r 



Q* k {By,r) — U (By,, 

Q* k * S r (y) - U(B y>r 



We wish to use Fourier inversion to derive bounds for these expressions 
in terms of the Fourier coefficients. We need continuity of Q* k *5 r for k > 2: 

Proposition 5. Let Q be defined as in Formulation 4, and let 8 r be denote 
the indicator function of B r . Then Q* k * 5 r is continuous for k > 2. 

This is proved in Appendix B. 

Hereafter, assume k > 2. We shall also assume for the moment that 
Q* k * 5 r has an absolutely convergent Fourier series, which will be verified 
later in the course of our computations. Since Q* k * 5 r is a continuous 



function for k > 2, it is exactly equal to its Fourier series (Theorem 11), so 
that from (|6|) and (|i"6| ) we have 



D(k) 



sup 

y,r 



^(2n + 1) Tr \Q k ( Pn ) 5 r (p n ) p n (y) 



n=l 

oo 



(7) 



< supj^(2n + l) 



n=l 



Tr 



Q k (pn) $r(Pn) Pn(y) 
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where p n is the irreducible representation of 50(3) of dimension (2n + 1). 
The trivial representation po does not appear here since it was cancelled in 
(|) by U(B y>r ). 

Remark 6. Since Q and 5 r are both iV-bi-invariant on 50(3), by Theorem 
12 there is a basis for the representations such that their transforms are iden- 
tically zero except in the (1, l)-th entry. Any such basis (e.g., the spherical 
harmonics) has its first basis element given by the Legendre polynomials, 
which are the spherical functions for the Gelfand pair (50(3), N). 

Hence p n {y)(i,i) = -Pn(cos7), where 7 is the geodesic distance of y from 
n. Since the product of the transforms of Q k and S r are identically zero 
except for the (1, l)-th element, the only diagonal element changed by mul- 
tiplication by p n (y) is the (1, l)-th entry. Hence the trace (0) reduces to 

00 

D(k) < sup > y (2n + l) Q k (pn)(i,i) &r{Pn)(lX) -Pn(cos7) 



n=l 



(8) 



< SUp^(2n + l) Q k (Pn) (1,1) K-(Pn) (1,1) 



n=l 



where the second inequality follows from ([22]) . Notice that the sum in (g) is 
precisely the sum in Theorem [ll| that needs to be checked for convergence 
in verifying that Q* k * 5 r has an absolutely convergent Fourier series. Hence 
when we bound the above expression we will also have validated our use of 
Fourier inversion in our computations. 
From (|20|), we have 

(9) Q(Pn)(l,l) = Pn(cOS0) 

since P n is constant on the support of Q. Also, for a ball B r of geodesic 



radius r and n > 1, formula (21) gives 



$r(Pn)(l,l) 



P n (x) dx 



|P n _i(cosr) - P n+ i(cosr)| 



2(2n + 1) 



(10) 

(11) < — ^- . 

v ' ~ 2n + l 

The integral of P n follows from ( |23"| ) and noting that P n (X) 
inequality follows from (p2]). 

Substitution of @ and (^) into (§) yields 



1, and the 



(12) 



D(k)<J2\Pn(cos9)\. 



n=l 



To bound the Legendre polynomials, we use the following well-known 
bound (see Jackson |L4], p.63]): 
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Proposition 7. For P n , the n-th Legendre polynomial, and any 9, 

\p n {^e)\ 2 < — 

im sin 9 

We derive an alternate bound, suitable for small 9: 

Proposition 8. For P n , the n-th Legendre polynomial, and nsin 2 9 < .9, 

i d ( mi2 ^ 1 ™sin 2 6> 
\P n (cos0)f < 1 . 



This bound is better than Proposition ^ when n sin 2 9 < 2 — y4 — ^ ~ 
.794. It is proved in Appendix B. Using Propositions and || and the bound 
1 — x < e~ x , the sum in (^) can be estimated: 

oo , s fc/2 

E^)l < E + E (^fl) 

1=1 n<B/ sin 2 6 n>B/sin 2 6» 

e -fe sin 2 6»/8 / 2 \ fc / 2 1 



l_ e -fesin 2 e/8 V^sin 2 ^ ^ , n fe / 2 

n>B/ sin 2 



E 



where -B = .9. Note that 



1 f°° dx ; /sin 2 ' 

2^- „fc/2 - / R Ife/2 r 



^jfc/2 J g x k/2 y 

n>S/sin 2 6 l sin 2 o 

sin 2 6>\ 1_1 /sin 2 #^ 



+ 



k-2\ B ) \ B 

Thus 

e -fesin 2 e/8 ( 2 \ k/2 ( 2B \ 

El P "^)l ^ T.g-fcsin 2 ^ + (j^J V(A;-2)sin 2 +1 J ' 

Note that (4?) 1/2 < e" 1/8 < e _sin2e / 8 . For fc = -5- and C > 4, one sees 
V ttB I sir 9 — ' 

that (fc - 2) sin 2 9 > 2 and k > 4, so that 

fX(z)| < e- fe - 2 ^ (73^72 + * + 

n=l 



< 4.442 e~ c/8 . 



The above bound, together with (|12D, proves the following theorem. (Note 
that the C > 4 restriction above is not needed below because the discrepancy 
D{k) never exceeds 1.) 

Theorem 9. For the drunkard's walk on the sphere with step size 9, the 
discrepancy after k steps satisfies, for k = , 

D{k) < 4.442 e~ c/8 . 
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Thus order 



C 



steps are sufficient to make the discrepancy uniformly 



small. The following lower bound confirms the order is correct. 

Theorem 10. For the drunkard's walk on the sphere with step size 9, the 
discrepancy after k steps satisfies, for k > 2, 

D(k) > — \cos9\ k . 
- 4 

For k = c -> _ , we have 

din' 1 H ' 



D{k) > 0.4330 e 



-C/2 



Thus order 



J=jr§ steps are needed to make the discrepancy distance uni- 
formly small. Together, Theorems |9| and |K] prove Theorem ||. 

One way to obtain a lower bound for discrepancy is to evaluate the dif- 
ference of Q* k and U on well-chosen ball. The same idea can be used for 
the total variation; one way to choose such a ball (see [Q, p. 29]) is to take 
a set cut out by a random variable consisting of the dominant terms in the 
Fourier series of Q* k . The mean and variance of the random variable and 
an appeal to Chebyshev's inequality yield an estimate for Q* k on that set. 



However, the proof of Theorem 10 illustrates a different approach using 



ideas similar to those used in [22] for bounds on the torus. We construct a 



"local discrepancy" function which at each point evaluates the discrepancy 
of the measure on a set of geodesic radius r centered at that point. The 
function is bounded above by the total discrepancy. As before, it can be 
rewritten in terms of a convolution of the original measure and the indicator 
function of the set. An appeal to PlanchereFs identity gives a sum with only 
non-negative terms, so the dominant term can be pulled out as a lower bound 
for discrepancy. 

We remark that since discrepancy is a lower bound for total variation, 
this lower bounding technique can also be used to obtain lower bounds for 
random walks under total variation. 

Proof. Define, for g € SO (3), 

Arfo) = Q* k (By,r) ~ U(B y>r ) 

where y is the image of g under the quotient map from SO{2>) to S 2 . From 
(0), we see that A r (x) < D{k), and hence for all r, 



(13) 



50(3) 



A 2 r (g) dp < D(kf 



On the other hand, Plancherel's identity on SO(3) |], p. 256] yields 



(14) 



50(3) 



A 2 .(g) dp = Y^{2n + 1) Tr \A r (p n )A r (p n 



n=0 
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where the * denotes the conjugate transpose (here only). Notice that A r 
may be rewritten as: 

Ar(x) = Q* k * 6 §r (x) - U * 6 §r (x) = (Q* k - U) * 5 &r (x). 

Then A r may be computed as A r (p n ) = ( Q k {p n ) — U(p n ) ) 5g (p n )- 

For n = 0, A r (p ) = since Q k (p ) - U(po) = 1-1 = 0. 
For a trivial computation shows U(p n ) = 0, and thus 

A r (p n ) = Q k {p n ) $B r (Pn)- 

Remark || and the computations from Equations @ and (|l(]) , when substi- 
tuted into (14), and combined with (13), give 

(15) D(kf > f> + 1) ( P -'" :OS 2 [ 2 ) „" + P : ) +l(C ° Sr) ) 2 

where r may be chosen arbitrarily. Taking only the dominant term (n = 1) 
in the above expression, and letting r = ir/2, we have cosr = 0, Pq(0) = 1, 
p 2 (0) = -1/2, and Pi{x) = x. It follows that 

D(k) > — \cose\ k 
4 1 1 



as was to be shown. The second inequality in the theorem follows from 
| cos B\ k = e l lncos2e > e-1 sin2e , using the fact that ln(l - x) > -x for all 
x. □ 



For a tighter lower bound, one may use more terms in ( |i5[ ) and adjust the 
choice of r; however, the dominant term sufficed to obtain matching upper 
and lower bounds for this random walk. 



6. Conclusion 

A similar analysis can be carried out for the discrepancy convergence of 
any random walk on a Gelfand pair, when the spherical functions are known. 
Proposition ^| shows that making a generating measure bi-invariant will not 
affect the rate of convergence. The upper bound is obtained via (|5|), Fourier 
inversion to yield (|l^), and bounds on the appropriate spherical function 
(e.g., Prop. ^). The lower bound is obtained via Plancherel's identity applied 
to the square of the local discrepancy function, e.g., equations (|l3|) and (|l4|) , 
then choosing as many terms as needed. 

We remark that our pair of strategies often works well for obtaining 
matching upper and lower bounds because if there is a dominant Fourier 
coefficient, it appears to the same order in both upper and lower bounds. In 
our example, Q(l) was the dominant term; compare the upper bound (HD 



and lower bound (|15|). See 12, 21, 22 1 for more examples of this phenomenon 



in discrepancy bounds for random walks on groups. 
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Appendix A. 

This appendix reviews material on harmonic analysis, homogeneous spaces, 
Gelfand pairs, representations of 50(3), and Legendre polynomials. 

Fourier Analysis on a Compact Group. A standard reference is the 
encyclopedic account by Hewitt and Ross |l3|]. Diaconis H gives a concise 
introduction to Fourier analysis on finite groups. Dym and McKean |J is a 
readable introduction to Fourier series on 50(3). 

We assume henceforth that all compact groups are separable and metriz- 
able. For any compact group G there is a unique measure p on G, called 
(normalized) Haar measure, such that p is G-invariant and p(G) = 1. 

Let V be a finite dimensional vector space over C, the complex numbers. 
Recall that a representation of a group G on V is a homomorphism p : G —* 
GL(V). If V has dimension n, then p is said to have dimension n. A basis for 
V can be chosen so that the image of p with respect to this basis are unitary 
matrices. If there is no non-trivial subspace of V invariant under the action 
of G, then p is said to be irreducible; otherwise p decomposes as a direct sum 
of irreducible representations. (One can similarly define a representation p 
of G on a Hilbert space, though if G is compact p decomposes into a direct 
sum of unitary representations of finite dimension.) 

Two representations p on V and p' on V are equivalent if there is an 
isomorphism r : V — ► V such that r o p = p' o r. Let £ denote a the set of 
equivalence classes of irreducible representations of G. For a compact group, 
£ is countable and furthermore, all the irreducible representations are finite 
dimensional. 

Definition 1. The Fourier transform of a complex- valued function / on a 
compact group G at a representation p of G is defined by 

7(p) = / /G?) P^ 1 ) dp 

JgeG 

where p is Haar measure on G. 

Similarly, the Fourier transform of a measure v on G at p is defined by 

= / Pig' 1 ) dv{g). 

JgeG 

We show how a function may be recovered from its Fourier transforms at 
irreducible representations. Let d p denote the dimension of a representation 
p. For any operator A, let Tr[A] denote the trace of A, and let ||A|| ¥ , 1 denote 
the sum of the eigenvalues of the operator square root of AA*. (Here, * 
denotes conjugate transpose.) 

Definition 2. For any / G L l (G,p), the series 

(16) ^d p rr[/(p)p( 5 ) 

pes 
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is called the Fourier series of /. (There is mild abuse of notation here: 
by p £ S, we really mean to choose a representative p from each class of 
irreducible representations in E.) If 

(17) £dpll/(P)ll*>i<°°i 

pes 

then / is said to have an absolutely convergent Fourier series |l3|, (34.4)]. 

Theorem 11 (Fourier inversion). If a function f on G has an absolutely 
convergent Fourier series, then the Fourier series of f{g) converges uni- 
formly to a continuous function f{g), and f(g) = f(g) almost everywhere 
on G with respect to Haar measure p. 



Proof. This theorem is embedded in Hewitt and Ross [|13| , but obscured by 
their exotic notation. We briefly indicate how to "prove" this theorem from 



results cited in [13|. 



The set of functions with absolutely convergent Fourier series is denoted 
in |n| by a symbol that resembles 1Z(G), defined in (34.4). Theorem (34.6) 
in [|l3[ shows that any / £ 7&(G) is equal almost everywhere to its Fourier 
series. Theorem (34.5.ii) shows that this Fourier series converges uniformly 
to a continuous function that we have denoted /. □ 



We remark that since the notation in Hewitt and Ross |13|] is cumbersome 
and tough to wade through, for the sake of probabilists we have simplified it 
by following the notation of Diaconis Q . To aid the reader wishing to follow 
the results quoted above, we provide a "dictionary" between the two sets 
of notation: in Hewitt and Ross (27.3)], a denotes a class of equivalent 
irreducible representations in £ and U is a representative of that class; we 
avoid reference to a (to eliminate an unnecessary layer of notation) and use 
p instead of U. Hewitt and Ross denote an arbitrary element of a group G 
by x £ G; we use g £ G. Their notations A a and U% refer to operators that 
correspond to our f(p) and p(g), respectively (see |13|, (34.1. i), (34. 4. a)]). 

Note that if / is continuous, Theorem [TT] implies that if / has an absolutely 
convergent Fourier series, it equals its Fourier series at every point. 

Homogeneous Spaces and Gelfand Pairs. Diaconis |4], Chap. 3F] pro- 
vides an introduction to Gelfand pairs on finite groups and an annotated 
bibliography. Dieudonne is a concise introduction to Gelfand pairs on 
compact and locally compact groups. 

Definition 3. Let Gbea compact group and X be a topological space. An 
action of G on X is a continuous mapping from G x X — > X denoted by 
(s, x) i— ► s ■ x = sx such that id - x = x and s ■ (t ■ x) = (st) ■ x. 

If G acts transitively on X, that is, if for any x,y £ X there exists an s 
such that sx = y, we call X a homogeneous space. 

Given a point xo £ X, let N denote the isotropy subgroup of G with 
respect to xq, i.e., the set of group elements which fix xq. By construction, 
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N is a closed subset of G. The canonical isomorphism of X onto G/N, the 
left cosets of N, respects the action of G. Thus g : xN i— > (gx)N. 

Let fix denote the G-invariant measure on X induced by Haar measure 
on G. Let L 2 (X) denote the space of all complex-valued square-integrable 
functions on X with respect to fix ■ The action of G on X induces an action 
of G on L 2 (X) by g ■ f(x) = f(g~ 1 x). This action is a 1-to-l linear mapping 
of the vector space L 2 (X) into itself and so defines a representation of G. 

Definition 4. A function / on G is said to be N -bi-invariant if f(n'gn") = 
f(g) for all n',n" £ N and g 6 G. A measure v on G is N -bi-invariant if 
for any measurable set A in G, v{n' An") = v{A) for all n',n" £ N. Thus v 
satisfies v * fi = fi* v = v where fi is Haar measure on G. 

In this paper bi-invariance on a homogeneous space G will be understood 
to mean with respect to the isotropy subgroup N . Note that bi-invariant 
functions on G are constant on double cosets NgN and may therefore be 
viewed as functions on the double coset space (denoted G//N), or as left- 
invariant functions on X via its isomorphism with G/N. 

Definition 5. The pair (G, N) is called a Gelfand pairii the convolution al- 
gebra L 2 (G//N) of 7V-bi-invariant functions is commutative. We sometimes 
say X = G/N is a Gelfand pair when G is understood by context. 

The next fact about Gelfand pairs is the most important for our purposes. 
A similar result for the finite group context may be found in [Q, p. 54]. 

Theorem 12. // (G, N) is a Gelfand pair, then for every irreducible repre- 
sentation p : G — » GL(V) there is a basis of V such that for all functions 
f (resp. measures v) bi-invariant with respect to N , the Fourier transform 
f{p) (resp. v{p)) in that basis contains only zeroes except possibly for the 
(1, l)-th entry. 

Proof. Dieudonne (22.5.6)] shows the algebra L 2 (G/ /N) is commutative 
if and only if the number of times the trivial representation appears in p\n, 
the the restriction of p to N, is zero or one. 

If one, this trivial representation corresponds to a one-dimensional sub- 
space of V fixed by N, i.e., the left iV-invariant functions on X; choose the 
unique function s(x) on X normalized so that s(xq) = 1. This is sometimes 
called the spherical function of (G, N) corresponding to the representation p. 
Complete s to a basis for V so that the matrices of pi at break into irreducible 
"blocks". Then for a right iV-invariant function /: 



(18) 



fip) 




16 



FRANCIS EDWARD SU 



where is Haar measure on the subgroup N. The second equality is 
obtained by choosing a coset representative x from each coset in G/N and 
expressing g = xn for some x and n G N, and noting that Haar measure \i 
decomposes as a product measure px "MAT- A similar argument holds for a 
right-invariant measure v, noting that v decomposes as a product measure 
vx ■ Pn because of right-invariance. 

By the orthogonality relations for matrix entries || (21.2.5.c)] of irre- 
ducible representations of N, the left-most integral of ( pj| ) produces a matrix 
consisting of zeroes except possibly for the (1, l)-th entry. Thus f(p) (resp. 
v(p)) has zero entries except possibly for the first row. A similar argument 
using the left-invariance of / (resp. v) shows that f(p) (resp. v{p)) has 
zero entries except possibly for the first column. Together, these statements 
imply that the only entry that could possibly be non-zero is the (1, l)-th 
entry. 

If the trivial representation does not appear in the the restriction of p to 
N, the argument above holds by ignoring the role of s when choosing a basis 
for V. Orthogonality then shows that the left-most integral of (^) yields a 
zero matrix. □ 

There is thus one spherical function Si(x) for every irreducible represen- 
tation pi appearing in L 2 (X). These induce iV-bi- invariant functions s, on 
G. In the theorem above the Si(g) appears as the (l,l)-th entry of Pi(g) 
for an appropriate basis. Hence for any measurable function / on G, the 
(1, l)-th entry of the Fourier transform at pi satisfies 



The sphere as a Gelfand pair. The rotation group SO(3) acts on the 
unit sphere S 2 by the natural inclusion of S 2 in R 3 . This action is clearly 
transitive on S 2 , so S 2 is a homogeneous space. In fact arises from the 
Gelfand pair (G,N), where G is the rotation group 50(3), and ./V is the 
isotropy subgroup of rotations fixing n, the north pole. By restriction to 
the plane orthogonal to n E R 3 , we see that N is isomorphic to the group 
SO(2). The sphere S 2 may then be regarded as the space SO(3)/SO(2). In 
fact, for all n > 2, Dieudonne § shows that S n = SO(n + l)/SO{n) is a 
Gelfand pair. 



(19) 




Similarly, for a measure v on G 



J g eG 

Dieudonne Q is a readable introduction to the general theory of spherical 
functions; Letac computes them in several examples. 




Representations of S0(3). For a good reference on representations of 
S0(3) and other compact Lie groups, see Brocker and torn Dieck ||. 
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Let A = -k-^t + + •5ht be the Laplace operator on R 3 . The harmonic 

OXf OX2 OX3 r L 

polynomials are the set of all complex-valued homogeneous polynomials / in 
xi,X2,X3 of degree n such that Af = 0; the restrictions of these functions 
to the sphere S 2 form a set V n , the spherical harmonics of degree n (one of 
which is the spherical function s n ). 

The action of SO (3) on V n is induced by its action on R 3 in the manner 
described earlier: g ■ f(x) = f(g~ 1 x). Moreover, V n is irreducible and finite- 
dimensional, and every irreducible representation of SO(3) arises in this way. 
The dimension of V n is 2n + 1. 

Legendre polynomials. The spherical functions s, on S 2 are given by 
the well-known Legendre polynomials Pj in the following way: for y £ S 2 , 
Si{y) = Pi(x) where x = cos8 y £ [—1,1] and 9 y is the geodesic distance 
between y and n on S 2 . Just as S 2 is (isomorphic to) the left cosets of N 
in G, the set [—1,1] is the double coset space of this Gelfand pair. 

Since Haar measure on SO (3) induces the uniform (rotation-invariant) 
probability measure on S 2 and uniform probability measure on [—1,1], we 



can compute (19) as 



1 



(21) [/(Pn)](l,l) = / f(9)U9) <*M = 5 / f(x)P n (x) dx 

JSO(3) 1 J-l 

where fi denotes normalized Haar measure on SO (3), dy denotes the uniform 
measure on S 2 , and dx is Lebesgue measure on R. See |9], p. 239]. 
The Legendre polynomials have the generating function 



i=0 



' 11 (1 - 2xr + r 2 )V2 



and the first few are: Pq(x) = l,Pi(x) = x,P2(x) = \{3x 2 — l),Ps 
^(5x 3 — 3x). Furthermore, for all n > 0, PniX) = 1 arid 

(22) |P n (x)| < 1 

for x G [—1, 1]. The following identity will be needed later. For n > 1, 

-fn+i( x ) — -^n-l( x ) 



(23) P n (x) 



2n + l 



This follows from the generating function for P n [x). A nice account of these 
and other properties of Legendre polynomials may be found in Jackson |l4|] . 



Appendix B. 



This appendix contains the proofs of some technical results (Proposition 
H and Proposition ||) that are not central to the development above. 
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Proof of Proposition To show that Q* k *5 r (x) is continuous for k > 2, 
we first require the following technical lemma. 

Lemma 13. Let v be a positive measure on a compact metric group G, and 
let f be any measurable, bounded function f with discontinuities on a set 
Df. Let xDj 1 denote the set {xd~ l : d G Df}. Given x, if ^(xDj 1 ) = 0, 
then the convolution 



h{x) = v * / (x) = I f(z 1 x) dv{z) 

is continuous at x. 



G 



Proof. To show h{x) is continuous at x, consider any sequence x n € G such 
that x n — ► x. It must be shown that h(x n ) — ► h(x). 

Let w n {z) = f{z~ 1 x n ) and w(z) = f(z~ 1 x). Since / is bounded, all the 
w n and w, being translates of /, are uniformly bounded by some constant 
function. This constant function is in L 1 (G, u), since G is compact. 

Also, w n (z) — > u>(z) pointwise for all z xD^ x , since / is continu- 
ous at those points. By the assumption on Df we have pointwise con- 
vergence almost everywhere. By Lebesgue's dominated convergence the- 
orem, J G w n (z) du(z) — > j G w(z) du(z), which is precisely the statement 
h(x n ) — > This completes the proof of Lemma 13. □ 

We can now prove Proposition |5[ 



Proof. We apply Lemma 13 setting /(x) = <5 r (x) and v = Q. By inspection, 
S r is bounded by 1. The lemma implies Q * 6 r (x) is continuous everywhere 
except possibly at x = id. This may be seen by observing that the disconti- 
nuity set of S r is dB r , the boundary of B r . This is the pre-image of a circle 
on S 2 . On the other hand, Q regarded as a measure on S 2 is uniformly sup- 
ported on a circle at latitude 9 from the north pole. Any two circles on S 2 
intersect in at most two points, unless they are identical. Hence Q(dB r ) = 
unless the support of Q intersects dB r , which only occurs when B r = B n g. 
This corresponds to a discontinuity in Q * 5 r (x) at x = id when r = 8. 

We now apply Lemma |L| again to show that Q* k * 5 r (x) is continuous for 
k = 2. The preceding observations show that Q * 5 r (x) is continuous almost 
everywhere (except possibly at the identity which is not in the support of 
Q). It is bounded by 1. Applying the lemma for f(x) = Q*5 r (x) and v = Q 
shows that Q* 2 * S r (x) is continuous everywhere. 

Now proceed by induction on k. For k > 3, let v = Q and let f(x) 



Q*( k x ) * 5 r (x), which is continuous. Then Lemma 13 shows that v * f(x) = 
Q* k * S r (x) is continuous. □ 

Proof of Proposition [8]. This proves a Legendre bound for small 9. 

Proof. Let x = cos 6. From O, p. 62], 

2 r /2 



P n (x)\ < - r e ~ nz2w2 / 2 dw 

7T Jo 
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for z = |(1 - x 2 ) 1 / 2 . Substitute t = n l / 2 zw, and set A = n 1 / 2 zir/2 
n 1//2 sin#. Obtain 



\P n {x)\<±J e~ t2 ' 2 dt. 



To estimate this integral, square both sides and consider the double inte- 
gral over a square in first quadrant of the plane, and then change to polar 
coordinates: 



\P n {x)\ 2 < ^J q A J^e^-^dh dt 

J<f>=0 Jr=0 
2 />7r/4 



r A/cos \-^rdr 



1 - e 



-A 2 /2cos 2 <t>\ 



For 0<^><7t/4, y = 4r sec 2 4> < A 2 which by assumption is less than .9. 
Now for y < .9, the inequality 1 — e~ y <y — holds, and yields 



2 f 71 "/ 4 / A 2 
Integrating the right hand side gives 



3^ 
~32 



sec 



tan 6 — 



3 A 2 /tan 3 , 



16 V 3 



tan ( 



t/ 4 ^2 



□ 
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